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EFFICIENT LOCKING FOR THREAD-SAFE SELF-MODIFYING CODE 

ABSTRACT 

A locking mechanism for use in a multi-thread environment supporting self-modifying code in which 
modifications to the code are made at runtime. The locking mechanism having associated helper 
5 code accessed by a call from the first instruction address in the code block. The helper code 
calculating the binary encoding for the call instruction and using an atomic compare and exchange 
instruction to compare the calculated binary encoding with the actual contents of the first instruction 
address. Where there is a match, a self loop instruction is written to the first instruction address to 
lock the specified code block for subsequent threads. The helper code contains instructions to 
10 resolve the references in the specified block. The last such instruction is an atomic store operation 
to replace the self loop instruction at the first instruction address with the appropriate modified 
instruction. 
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EFFICIENT LOCKING FOR THREAD-SAFE SELF-MODIFYING CODE 

FIELD OF THE INVENTION 

The present invention is directed to an improvement in computing systems and in particular to an 
5 improvement in locking for multiprocessor computer systems supporting thread-safe self-modify ing 
code. 

BACKGROUND OF THE INVENTION 

Executable code for a computer may, as part of its runtime operation, carry out self-modifying 
1 0 operations. One common example of such self-modifying executable code is where executable code 
contains unresolved data or procedure references. This will occur, for example, where Java language 
code is dynamical ly compi led to produce code that calls methods and refers to data that is unresolved 
at compile time. One approach to permit the compiled code to be executed correctly is to resolve 
the previously unresolved references at the time that the code is first executed. The compiler 
1 5 generates executable code for calling a routine, sometimes referred to as helper code, which carries 
out reference resolution at runtime. The helper code is typically provided with the compiler. 

When, during execution, the unresolved reference is reached, the helper code is provided with data 
identifying the reference to be resolved and is then executed. The helper code carries out instructions 
to resolve the previously unresolved method call or data references in the code. The helper code 

20 modifies the executable code by overwriting the unresolved references with the proper values which 
the helper code has determined at execution time. This ensures that the runtime resolution of the 
reference occurs only once. The code generated by such a compiler is referred to as "self-modifying" 
because the code's call to the helper code results in the replacement of a portion of the executable 
code (originally containing the unresolved references) with modified code which has resolved 

25 references. 
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In multiprocessor computers, self- modification of code at execution time may occur for the above, 
or other reasons. Such self-modification may create errors if different threads or processes execute 
a section of code which is in the process of being modified by another process or thread. To prevent 
this potential problem (race condhion), different solutions have been proposed in the prior art. One 
S approach is to implement a global locking arrangement. This requires a process or thread to obtain 
a global lock to modify a given section of code. The use of a global lock prevents multiple threads 
from executing the code to be modified, but there is a significant overhead involved as the processes 
or threads waiting on the global lock are unable to cany out other processing that may be uiu^lated 
to the code being modified. 

10 Other solutions have been devised which rely on local lock arrangements. For example, a byte or 
word may be reserved for each code site being modified. Threads will lock on the byte or word for 
the code site. This approach has a cost in the code space required to implement the specific locks 
for each code site. This approach must also include code to avoid the potential race condition where 
a processor is executing in the area of code being modified at the time that the code is being updated, 

1 5 but before the processor reaches code to determine whether the lock is available or not. 

Another solution is to add control flow to the section of code being modified to protect the code by 
preventing access to the code by other threads until the code has been modified. This approach will 
leave orphaned control flow statements in the executable code even after the modification of the 
code has taken place. 

20 It is also possible to add a level of indirection in the code which calls the routine (the helper code) 
that resolves the reference. A locking mechanism may be more readily implemented for such a 
calling arrangement but the resulting code will be slower than otherwise due to the introduction of 
the additional level of indirection. 

It is therefore desirable to have a mechanism for locking in a multi-processor thread-safe 
25 environment to permit the runtime modification in self-modifying code such that the resulting code 
is efficient and potential race conditions are eliminated. 
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SUMMARY OF THE INVENTION 

According to an aspect of the present invention there is provided an improved locking mechanism 
for multiprocessor computer systems supporting thread-safe self-modifying code. 

According to another aspect of the present invention there is provided a computer program product 
including a computer usable medium having computer readable program code means embodied in 
the medium for defining code to provide a locking mechanism for self-modifying code in a multi- 
thread environment, the self-modifying code including helper code callable to modify instructions 
in a defined block of the self-modifying code, the computer program product including 

computer readable program code means for defining an atomic compare and exchange 
instruction in the locking mechanism, the defined compare and exchange instruction carrying 
out a comparison of an unreserved lock value with a first instruction in the defined block of 
self-modifying code, the defined compare and exchange instruction exchanging the first 
instruction in the defined block of self-modifying code with a self-loop instruction, where 
the comparison shows a match, 

computer readable program code means for defining code to return execution to the first 
instruction in the defined block of self-modifying code where the comparison shows no 
match, and 

computer readable program code means for defining code to permit the remainder of the 
helper code to be executed to carry out modifications in the defined block of self-modifying 
code, including as a last step an atomic store to replace the self-loop instruction with a 
modified instruction, where the comparison shows a match. 

According to another aspect of the present invention there is provided the above computer program 
product, further including computer readable program code means for defining the fu:st instruction 
in the defined block of self-modifying code to be a call instruction to the helper code and for defining 
the unreserved lock value to be calculated in the helper code based on the return call instruction 
address passed to the helper code. 



CA9.2000-0080 



CA. 02346766 2001-05-07 



According to another aspect of the present invention there is provided the above computer program 
product, further including computer readable program code means for defining the first instruction 
in the defined block of self-modifying code to be a call instruction to the helper code and for storing 
the unreserved lock value as a binary encoding of the call instruction available to the helper code. 

5 

According to another aspect of the present invention there is provided the above computer program 
product in which the helper code is loaded at a non-boundary location in memoiy. 

According to another aspect of the present invention there is provided the above computer program 
1 0 product, further including computer readable program code means for defining the first instruction 
in the defmed block of self-modifying code to be an illegal instruction to interact with a defined trap 
handler to pass control to the helper code, and for defining the unreserved lock value to be the binary 
encoding of the illegal instruction. 

1 5 According to another aspect of the present invention there is provided the above computer program 
product in which the helper code replaces unresolved references in the defined block of self- 
modifying code. 



According to another aspect of the present invention diere is provided a method for locking self- 
20 modifying code in a muUi-thread environment, the self-modifying code including helper code 
callable to modify instructions in a defined block of the self-modifying code, the method including 
the steps of: 

defining an atomic compare and exchange instruction in the locking mechanism, 

the defined compare and exchange instruction carrying out a comparison of an 
25 unreserved lock value with a first instruction in the defined block of self-modify ing 

code. 
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the defined compare and exchange instruction exchanging the first instruction in 
the defined block of self-modifying code with a self-loop instruction, where the 
comparison shows a match 

defining code to return execution to the first instruction in the defined block of self- 
5 modifying code where the comparison shows no match, and 

defining code to permit the remainder of the helper code to be executed to carry out 
modifications in the defined block of self-modifying code, including as a last step an atomic 
store to replace the self-loop instruction with a modified instruction, where the comparison 
shows a match. 

10 

According to another aspect of the present invention there is provided the above method, further 
including the steps of generating the first instruction in the defined block of self-modify ing code to 
be a call instruction to the helper code and of defining code for calculating the unreserved lock value 
in the helper code based on the return address passed to the helper code. 

15 

According to another aspect of the present invention there is provided the above method, further 
including the steps of generating the first instruction in the defined block of self-modifying code to 
be a call instruction to the helper code and of defining code for storing the unreserved lock value as 
a binary encoding of the call instruction available to the helper code. 

20 

According to another aspect of the present invention there is provided the above method, further 
including the step of defining code for loading the helper code at a non-boundary location in 
memory. 

25 According to another aspect of the present invention there is provided the above method, further 
including the steps of generating the first instruction in the defined block of self-modifying code to 
be an illegal instruction to interact with a defined trap handler to pass control to the helper code, and 
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of defining code for setting the unreserved lock value to be the binary encoding of the illegal 
instruction. 

According to another aspect of the present invention there is provided the above method, in which 
5 the helper code replaces unresolved references in the defined block of self-modifying code. 

According to another aspect of the present invention there is provided a locking mechanism for self- 
modifying code in a multi-thread computer system, the self-modifying code including helper code 
callable to modify instructions in a defined block of the self-modifying code, the locking mechanism 
10 including an atomic compare and exchange instruction, 

the compare and exchange instruction carrying out a comparison of an unreserved lock value with 
a first instruction in the defined block of self-modifying code, 

the compare and exchange instruction exchanging the first instruction in the defined block of self- 
modifying code v^dth a self-loop instruction, where the comparison shov^ a match, 

15 the locking mechanism including code defined to return execution to the first instruction in the 
defined block of self-modifying code where the comparison shows no match, 
the locking mechanism including code defined to permit the remainder of the helper code to be 
executed to carry out modifications in the defined block of self-modifying code, including as a last 
step an atomic store to replace the self-loop instruction with a modified instruction, where the 

20 comparison shows a match. 

According to another aspect of the present invention there is provided a method for generating 
executable computer code to define a locking mechanism for runtime resolution of unresolved 
references in a specified block of executable code in a multithread environment, the method 
25 including the following steps: 

inserting a call instruction at the first instruction address in the specified block of executable 

code, the call instruction branching to a block of helper code, 

defining a lock mechanism in the helper code by 
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including instructions in the helper code to calculate the binary encoding for the 
call instruction at the first instruction address, 

including an atomic compare and exchange instruction in the helper code, the the 
instruction having arguments defined to be the calculated binary encoding for the 
5 call instruction, the binary encoding for a self loop instruction, and the first 

instruction address, 

including a branch to the furst instruction address in the specified block of 
executable code, the branch being taken when the atomic compare and exchange 
instruction returns no match between the calculated binary encoding for the call 
10 instruction and the contents at the first instruction address, 

defining instructions in the helper code for the resolution of unresolved references in the 
specified block of executable code, the last such instruction being defined to be an atomic 
store instruction to replace the instruction at the first instruction address. 

1 5 According to another aspect of the present invention there is provided a computer program product 
including a computer usable medium having computer readable program code means embodied in 
the medium for carrying out the above method. 

Advantages of the present invention include a locking system that provides local locking, is 
20 efficient in the resulting runtime code space and results in an optimal code sequence in the self- 
modifying code after it has been modified. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram illustrating sample code sequences for self-modifying code 
25 according to the preferred embodiment. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Figure 1 shows a block diagram illustrating an example code sequence in which block 1 0 represents 
a sequence of instructions leading up to the sequence to be modified, block 12 represents the 
sequence of instructions to be modified, and block 14 represents instructions following the sequence 
5 to be modified. Instruction 16 within block 10 is shown in Figure I to have the call to the routine 
which modifies block 12 instructions at runtime. Instruction 18 in Figure 1 is the instruction 
following the call instruction of instruction 16. In the preferred embodiment illustrated in Figure 1 , 
the code to modify block 12 instruction is a reference resolving routine shown as helper code 20. 

The locking mechanism of the preferred embodiment uses a compare and exchange instruction. An 
10 example of such an instruction is the cmpxchg instruction found in the instruction set for Intel 
Architecture x86 processors (Intel 80486-compatible processors; the instruction set is sometimes 
referred to as the IA32 instruction set). The instruction is an atomic compare and exchange. The 
cmpxchg instruction takes three arguments, for the purposes of illustration, Rx, Ry, Tl. On 
execution, the value of Rx is compared with the contents of a memory location speciGed by address 
15 Tl. If the value Rx matches the value stored at Tl, then the value Ry replaces the contents of 
memory location Tl , the old value of T I is stored in Ry, and a zero flag is set to on. If the Rx value 
does not match the contents of address T 1 , then Rx is reset to be the contents of Tl and the zero flag 
is turned off. Other atomic compare and exchange instructions analogous to the cmpxchg instruction 
may be used to implement the invention. 

20 The preferred embodiment is described with reference to the ia32 instruction set implemented on a 
machine supporting a runtime stack. 

In the preferred embodiment, the compiler identifies an instruction sequence with unresolved 
references (exemplified as block 1 2 in Figure 1). The compiler inserts a first instruction in block 1 2 
to call helper code 20. This helper code call instruction is inserted at instruction 16 in the example 
25 shown in Figure 1 . As a result, any thread executing the instructions of block 1 2 will first execute 
the helper code call instruction found at instruction 16. Helper code 20 is defined to obtain a lock 
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and to execute only once to modify the code in block 12, as described below. 

In the preferred embodiment, when helper code 20 is called by the helper code call instruction at 
instruction 16, the address of instruction 18 is pushed onto the runtime stack and control is 
transferred to the instructions of helper code 20. The lock mechanism uses the cmpxchg instruction 
5 to modify the helper code call instruction initially found at instruction 16 in block 12. The first 
thread to reach the cmpxchg instruction in helper code 20, by executing the (atomic) cmpxchg 
instruction, carries out a modification of instruction 16. 

The lock mechanism for helper code 20 is implemented by helper code 20 including the cmpxchg 
instruction which in the preferred embodiment is used to effectively determine if the thread 

10 executing the cmpxchg instruction is the first one to reach that point in the code. This is 
accomplished by helper code 20 including instructions to calculate the bytes representing the helper 
code call instruction that was initially emitted by the compiler and loaded into instruction 16. This 
calculated helper code call instruction bit encoding is then compared with the actual contents of 
instruction 16 using the atomic cmpxchg instruction. A match of the calculated bit encoding for the 

15 helper code call instruction, and the actual contents of instruction 16 signifies that the current thread 
is the first one to reach the cmpxchg instruction in helper code 20. 

If the compare does indicate a match, the instructions in helper code 20 carry out steps having two 
consequences: the helper code call instruction at instruction 1 6 is replaced by a spin loop instruction, 
and the remainder of helper code 20 will be executed by the thread. Because the replacement of the 

20 helper code call instruction at instruction 16 is carried out as part of the atomic compare and 
exchange instruction, the first thread to reach the cmpxchg instruction in helper code 20 will be able 
to execute helper code 20, but later threads will not. Expressed in terms of the lock mechanism, the 
first thread to execute the cmpxchg instruction will hold the lock for helper code 20. The 
mechanism of the preferred embodiment for achieving the lock on helper code 20 is set out in further 

25 detail below. 

In the preferred embodiment, helper code 20 includes code for calculating the address of instruction 
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i 6. This is accomplished by helper code 20 popping the return address for the call instruction off 
the stack. Because the first instruction in block 1 2 is defined by the compiler to be a call instruction 
(to the helper code), the address for instruction 1 6 is computed in helper code 20 by subtracting the 
length of the call instruction from the return instruction address (the address of instruction 1 8) 
5 popped from the stack. 

The relative immediate offset for helper code 20 from instruction 1 6 is then calculated by subtracting 
the return address popped from the stack from the address from the start of helper code 20. This 
calculated relative immediate offset is combined with the code for a call instruction to provide the 
calculated bit encoding for the call instruction for helper code 20 as it would be emitted at the 
1 0 location of instruction 16. In the preferred embodiment, the calculated call instruction is generated 
by shifting the relative immediate offset by 8 bits and loading the opcode for the call instruction 
(0xe8) into the low order byte. The resuh is the first 4 bytes of the binary encoding for the helper 
code call instruction originally generated for the call at instruction 16 by the compiler. 

The compare and exchange instruction in helper code 20 is then given the following three arguments: 

1 5 the calculated call instruction binary encoding (as defined above), 

the self-loop instruction 2 bit binary encoding <jmp $-2 or Oxfeeb), and 

the address of the original call instruction (instruction 16 in Figure 1). 

When the thread reaching the compare and exchange instruction is the first thread to do so, the 
compare and exchange is carried out by a comparison of the calculated call instruction with the value 
20 at the address of the original call instruction. The result is a match and therefore the value of the self 
loop code (jmp$-2) is copied to the location of the original call instruction (instruction 16 in Figure 
1), as part of the atomic compare and exchange. 

Should a second thread reach instruction 16 after the compare and replace has been executed, that 
second thread will sit in the self loop spin until the instruction is altered, as described below. 
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If a second thread reaches instruction 16 before the compare and replace has been executed, that 
second thread will branch to helper code 20. When the second thread executes the compare and 
exchange, instruction 16 will have already been modified by the first thread (because the instruction 
is atomic, the first thread to reach it will always have replaced the call initially at instruction 16 with 
5 the self loop instruction). Helper code 20 is defined such that when the compare and exchange fails 
(the zero flag is set to off), control is returned to instruction 1 6. Where instruction 1 6 is the self loop 
instruction the second thread will spin on the instruction until that instruction is replaced as set out 
below. Once the modification of instruction 1 6 by helper code 20 occurrs, a second thread executing 
at instruction 16 will continue by executing the modified instruction at instruction 16 and then 
1 0 continue with the other instructions in (modified) block 12. 

Where a first thread acquires the lock and executes helper code 20, the modifications required to 
resolve the unresolved references in block 1 2 are carried out. The last such modification is defined 
to be the modification to the instruction to be located at instruction 16. The self loop instruction is 
replaced with the appropriate modified instruction using an atomic store operation. Any threads 
1 5 which had been spiiming on the self loop instruction then execute the modified code in block 1 2 and 
continue with execution of block 14 code. 

As will be apparent from the above, in the preferred embodiment, helper code 20 includes code that 
is used to implement the lock described above and also includes code that resolves references and 
which is used to modify the code in block 12. The mechanism of the preferred embodiment ensures 

20 that the code to modify block 12 is only executed once for each reference that is required to be 
resolved. By using the compare and exchange instruction in helper code 20, the preferred 
embodiment mechanism requires little code space to carry out the locking mechanism. Because the 
helper code call instruction (initially located at instruction 16 in Figure 1) is ultimately replaced by 
modified code, any thread accessing block 1 2 (after the self loop instruction has been replaced) will 

25 be executing optimal code. 

In the preferred embodiment, the helper code call instruction is 5 bytes in length. However, the 
cmpxchg instruction compares a two-byte word. For this reason, it is possible that the cmpxchg 

CA9-2000-0080 1 1 
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instruction will find a match between an already modified call instruction at instruction 16, and the 
calculated helper code call instruction. If this is the case, it is possible for the helper code 20 to be 
executed more than once. This is possible because a second or subsequent process may execute the 
helper code call instruction before the execution of helper code 20 replaces that instruction with the 
5 resolved reference. The cmpxchg instruction will be potentially executed more than once and the 
later executions will show a match between the calculated value and the actual value in instruction 
16. The result will be that the helper code will be executed when the lock mechanism ought to 
indicate that the lock is unavailable. The result is that such processes will replace the instruction 16 
instruction with the resolved reference when the replacement may have already occurred. 

10 In the preferred embodiment, this potential unnecessary execution of helper code 20 is avoided by 
using the align instruction to force helper code 20 to be located at a non-boundary position in the 
computer memory. The compiler of the preferred embodiment generates code that has executable 
code for procedures aligned to start on boundaries in the computer memory. By forcing helper code 
20 to be offset relative to the boundaries, the helper code call instruction will not have the same 2 

1 5 low order bytes as a call instruction that may be generated by helper code 20 and written into 
instruction 16. 

In the preferred embodiment described above, the instruction originally written to instruction 16 by 
the compiler is a call to helper code 20. However, an alternative implementation of the preferred 
embodiment is to use an illegal instruction rather than a call instruction as the original instruction 

20 at instruction 16. In such a case, the transfer of control resulting fi-om the execution of the illegal 
instruction includes a check to determine whether the illegal instruction is used to branch to helper 
code, or is a true error. If it is the former, then the mechanism set out above is used to carry out the 
modification. In this alternative implementation, the trap handler will contain code to call helper 
code 20 and to provide the information to helper code 20 to permit the resolution of the reference 

25 under consideration. In this case an exception record which includes the address of instruction 16 
will be generated by the system and made available to the trap handler. 

Although a preferred embodiment of the present invention has been described here in detail, it will 
CA9-2000-0080 12 
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be appreciated by those skilled in the art that other variations may be made. For example, rather than 
including code in helper code 20 to calculate the binary encoding for the helper code call instruction, 
a variation is to store a copy of that value in memory accessible by helper code 20, Such an approach 
includes a cost inherent in the storage of the binary encoding but will permit the locking mechanism 
of the invention to be achieved. This and other variations may be made without departing from the 
spirit of the invention or the scope of the appended claims. 
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THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE PROPERTY OR 
PRIVILEGE ARE CLAIMED ARE DEFINED AS FOLLOWS: 

L A computer program product comprising a computer usable medium tangibly embodying 
computer readable program code for defining code to provide a locking mechanism for self- 
5 modifying code in a multi-thread environment, the self-modifying code comprising helper code 
callable to modify instructions in a defined block of the self-modifying code, said computer program 
product comprising: 

computer readable program code means for defining an atomic compare and exchange 
instruction in the locking mechanism, 

10 the defined atomic compare and exchange instruction for carrying out a 

comparison of an unreserved lock value with a first instruction in the defined block of self-modify ing 
code, 

the defined atomic compare and exchange instruction for exchanging the first 
instruction in the defined block of self-modifying code with a self-loop instruction where the 
1 5 comparison indicates that the unreserved lock value matches the first instruction in the defined block 
of self-modifying code; 

computer readable program code means for defining code to return execution to the first 
instruction in the defined block of self-modifying code where the comparison indicates that the 
unreserved lock value does not match the first mstruction in the defined block of self-modifying 
20 code; 

computer readable program code means for defining code to pemait the remainder of the 
helper code to be executed to carry out modifications in the defined block of self-modifying code, 
including as a last step an atomic store to replace the self-loop instruction with a modified 
instruction, where the comparison indicates that the unreserved lock value matches the first 
25 instruction in the defined block of self-modifying code; 
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2. The computer program product of claim 1 further comprising computer readable program code 
means for defining the first instruction in the defined block of self-modiiying code to be a call 
instruction to the helper code and for defining the unreserved lock value to be calculated in the 
helper code based on a return call instruction address passed to the helper code. 

5 

3. The computer program product of claim 1 further comprising computer readable program code 
means for defining the first instruction in the defined block of self-modifying code to be a call 
instruction to the helper code and for storing the unreserved lock value as a binary encoding of the 
call instruction available to the helper code. 

10 

4. The computer program product of claim 2 or 3 in which the helper code is loaded at a non- 
boundary position in memory. 

5. The computer program product of claim 1 further comprising computer readable program code 
15 means for defining the first instruction in the defined block of self-modifying code to be an illegal 

instruction to interact with a defined trap handler to pass control to the helper code, and for defining 
the um^served lock value to be the binary encoding of the illegal instruction. 

6. The computer program product of claim I in which the helper code replaces unresolved references 
20 in the defined block of self-modifying code. 

7. A computer program product comprising a computer usable medium tangibly embodying 
computer readable program code means for defining code to provide a locking mechanism for self- 
modifying code in a muki-thread environment, the self-modifying code comprising helper code 

25 callable to resolve unresolved references in a defined block of the self-modifying code, the helper 
code loaded at a non-boundary position in memory, said computer program product comprising: 

computer readable program code means for defming an atomic compare and exchange 
instruction in the locking mechanism. 
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the defined atomic compare and exchange instruction for carrying out a 
comparison of an unreserved lock value wdth a first instruction in the defined block of self-modify ing 
code, 

the defined atomic compare and exchange instruction for exchanging the 
5 first instruction in the defined block of self-modifying code with a self-loop instruction v^rhere the 
comparison indicates that the unreserved lock value matches the first instruction in the defined block 
of self-modifying code; 

computer readable program code means for defining code to return execution to the 
first instruction in the defined block of self-modifying code where the comparison indicates that the 
10 unreserved lock value does not match the first instruction in the defined block of self-modifymg 
code; 

computer readable program code means for defining code to permit the remainder of 
the helper code to be executed to resolve references in the defined block of self-modifying code, 
including as a last step an atomic store to replace the self-loop instruction with a modified instruction 
15 where the comparison indicates that the unreserved lock value matches the first instruction in the 
defined block of self-modifying code; and 

computer readable program code means for defining the first instruction in the defined 
block of self-modify ing code to be a call instruction to the helper code and the imreserved lock value 
20 is calculated in the helper code based on a return address passed to the helper code. 

8, A method for locking self-modify ing code in a multi-thread environment, the self-modifying code 
comprising helper code callable to modify instructions in a defined block of the self-modifying code, 
the method comprising the steps of: 

25 defining an atomic compare and exchange instruction in the locking mechanism, 

the defined atomic compare and exchange instruction carrying out a 
comparison of an unreserved lock value with a first instruction in the defined block of self-modifying 
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code, 

the defined atomic compare and exchange instruction exchanging the first 
instruction in the defined block of self-modifying code with a self-loop instruction where the 
comparison indicates that the unreserved lock value matches the first instruction in the defined block 
5 of self-modifying code; 

defining code to return execution to the first instruction in the defined block of self- 
modifying code where the comparison indicates that the unreserved lock value does not match the 
first instruction in the defined block of self-modifying code; and 

defining code to permit the remainder of the helper code to be executed to carry out 
10 modifications in the defined block of self-modifying code, including as a last step an atomic store 
to replace the self-loop instruction with a modified instruction where the comparison indicates that 
the unreserved lock value matches the first instruction in the defined block of self-modifying code 

15 9. The method of claim 8 fiuther comprising the steps of generating the first instruction in the 
defined block of self-modifying code to be a call instruction to the helper code and of defining code 
for calculating the unreserved lock value in the helper code based on a return address passed to the 
helper code. 

20 10. The method of claim 8 fiirther comprising the steps of generating the first instruction in the 

defined block of self-modifying code to be a call instruction to the helper code and of defining code 
for storing the unreserved lock value as a binary encoding of the call instruction available to the 
helper code. 

25 11. The method of claims 9 or 1 0 fiirther comprising the step of defining code for loading the helper 

code at a non-boundary position in memory. 
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12. The method of claim 8 further comprising the steps of generating the first instruction in the 
defined block of self-modifying code to be an illegal instruction to interact with a defined trap 
handler to pass control to the helper code, and of defining code for setting the unreserved lock value 
to be the binary encoding of the illegal instruction. 

13. The method of claim 8 in which the helper code replaces unresolved references in the defined 
block of self-modifying code. 

14. A locking mechanism for self-modifying code in a muki-thread computer system, the self- 
modifying code comprising helper code callable to modify instructions in a defined block of the self- 
modifying code, the locking mechanism comprising: 

an atomic compare and exchange instruction, 

the atomic compare and exchange instruction for canying out a comparison of an 
unreserved lock value with a first instruction in the defined block of self-modifying code, 

the atomic compare and exchange instruction for exchanging the first instruction in 
the defined block of self-modifying code with a self-loop instruction where the comparison indicates 
that the unreserved lock value matches the first instruction in the defined block of self-modifying 
code; 

the locking mechanism including code defmed to return execution to the first instruction 
in the defined block of self-modifying code where the comparison indicates that the unreserved lock 
value does not match the first instruction in the defined block of self-modifying code; 

the locking mechanism including code defined to permit the remainder of the helper code 
to be executed to carry out modifications in the defined block of self-modifying code, including as 
a last step an atomic store to replace the self-loop instruction with a modified instruction where the 
comparison indicates that the unreserved lock value matches the first instruction in the defined block 
of self-modifying code. 
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15. The locking mechanism of claim 14 in which the first instruction in the defined block of self- 
modifying code is a call instruction to the helper code and the unreserved lock value is calculated 
in the helper code based on a return address passed to the helper code. 

5 16. The locking mechanism of claim 14 m which the first instruction in the defined block of self- 
modifying code is a call instruction to the helper code and the unreserved lock value is a stored 
binary encoding of the call instruction available to the helper code. 

17. The locking mechanism of claim 1 5 or 1 6 in which the helper code is stored at a non-boundary 
10 position in memory. 

18. The locking mechanism of claim 14 in which the first instruction in the defined block of self- 
modifying code is an illegal instruction defined to interact with a defined trap handler to pass control 
to the helper code, and in which the unreserved lock value is the binary encoding of the illegal 

1 5 instruction, 

19. The locking mechanism of claim 14 in which the helper code replaces unresolved references in 
the defined block of self-modifying code. 

20 20. A method for generating executable computer code to define a locking mechanism for runtime 
resolution of unresolved references in a specified block of executable code in a multithread 
environment, the method comprising the steps of: 

a) inserting a call instruction at a first instruction address in the specified block of 
executable code, the call instruction branching to a block of helper code; 

25 b) defining the lock mechanism in the helper code by: 

i) including instructions in the helper code to calculate a binary encoding 
for the call instruction at the first instruction address, 

ii) including an atomic compare and exchange instruction in the helper 
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code, said included instruction having arguments defined to be a calculated binary encoding for the 
call instruction, the calculated binary encoding for a self loop instruction, and the first instruction 
address, 

iii) including a branch to the first instruction address in the specified 
block of executable code, the branch being taken when the included atomic compare and exchange 
instruction identifies that the calculated binaiy encoding for the call instruction does not match 
contents at the first instruction address; 

c) defining instructions in the helper code for the resolution of unresolved references 
in the specified block of executable code, the last such instruction being defined to be and an atomic 
store instruction to replace the instruction at the first instruction address. 

21. A computer program product comprising a computer usable medium tangibly embodying 
computer readable program code means for carrying out the method of claim 20. 
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